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A MODIFIED  SRIF  FILTER 
USING  LDL'  FACTORIZATIONS 


INTRODUCTION 

The  problem  of  estimating  a set  of  parameters  or  variables  from  a set  of  measurements 
has  long  been  of  interest.  Kalman  in  the  early  sixties  provided  a simple  recursive  estimation 
procedures  by  introducing  the  concept  of  state  and  state  transition.  This  procedure  in  some 
instances  provided  simpler  implementation  than  batching  techniques.  Since  Kalman's  work  a 
number  of  numerical  procedures  have  been  developed.  An  excellent  account  of  these  pro- 
cedures as  well  as  historical  notes  can  be  found  in  Bierman’s  book  (11.  Two  basic  techniques 
described  in  Ref.  1 are  the  square-root  information  filter  (SRIF  filter)  and  the  UDU  filter 
(where  UDU  should  actually  be  UDU’,  with  U being  an  upper  triangular  matrix  and  D being  a 
diagonal  matrix).  In  both  cases  the  data  are  prewhitened  using  Cholesky  factorization.  The 
SRIF  filter  is  then  based  on  the  Householder  transform.  The  UDU  filter  is  based  on  a Chole- 
sky factorization  of  the  smoothed  covariance  update  using  one  measurement  at  a time  and  the 
modified  Gram  Schmidt  for  updating  the  predicted  covariance.  These  numerical  techniques  are 
claimed  to  have  better  numerical  stability  than  direct  use  of  the  Kalman  filter  equations  and 
may  be  more  amenable  to  hardware  implementation. 

In  Ref.  1 the  SRIF  filter  described  used  a Cholesky  factorization  of  the  covariance  matrix 
of  the  form  of  a triangular  matrix  times  the  transpose  of  the  same  triangular  matrix.  This 
report  is  concerned  with  reformulating  the  SRIF  filler  in  terms  of  a Cholesky  factorization  of 
the  covariance  matrix  using  a triangular  matrix  times  a diagonal  matrix  times  the  transpose  of 
the  same  triangular  matrix,  where  the  diagonal  elements  of  the  triangular  matrix  are  1.  The 
reason  for  changing  the  factorization  is  to  attempt  to  remove  some  of  the  square-root  opera- 
tions. 

The  following  section  briefly  reviews  the  Kalman  filter.  The  SRIF  filter  is  next  derived 
using  the  alternate  factorization.  The  properties  of  a modified  Householder  algorithm  which  are 
necessary  for  the  SRIF-filter  mechanization  are  shown.  The  prediction  process  is  shown  for  the 
case  of  no  process  noise,  and  an  exponential  time  weighting  into  the  past  is  incorporated. 
Finally  the  entire  algorithm  is  written  out  and  examined. 

MODIFIED  SRIF  FILTER 

The  SRIF  filter  is  a numerical  method  of  implementing  the  Kalman  filter  (1).  The  Kal- 
man filter  is  obtained  from  modeling  the  process  as  state  equations,  defining  a measurement 
procedure,  and  best-estimating  the  states  of  the  systems.  The  state  equation  and  measurement 
process  are  defined  as 

*(*)  - <t»(Ar)  X(k  - 1)  + r(Ac)  W{k) 


XM(k)  - H(k)  X(k)  + V(k), 


Manuscript  submilled  February  21,  1971. 


_■  sr*:-'-' 


B.  H.  CANTRELL 


< 


where  it  is  desired  to  best-estimate  the  n-by-1  state  vector  X(k).  The  remaining  quantities  are 
n-by-n  state  transition  matrix  4>(A),  an  n-by-p  matrix  I'(A),  an  m-by-n  measurement  matrix 
H(k),  and  an  m-by-1  measurement  vector  XM(k ),  with  M '(A)  and  ^(A)  being  Gaussian  noises 
with  the  following  properties: 

E\\V(k)\  -0, 

E^W(k)  W'(J)\  “ S(k)  8,*. 

fjf'fA)]  -0, 

E^V(k)  vO)j  - Q(k)  8m, 
and 

E\w(k)  V(j)\  « 0. 
in  which 

8,a  “ 1 when  j — k and  0 otherwise. 


The  covariance  matrices  S(k)  and  Q(k)  are  of  dimension  p by  pand  m by  m,  respectively. 

The  best  estimate  of  X(k),  denoted  by  X(k)  in  the  standard  Kalman-filter  format,  is 

X(k ) - X(k)  + K (A)  [j|fM(A)  - //(A)  Jf(A)|.  (1) 

where  K (A)  is  the  filter  gain,  given  by 


K(k)  - P(k)  H'(k)  Q Hk). 

In  equation  (2)  P(k)  is  the  smoothed  covariance  matrix,  given  by 

P '(A)  - P '(A)  + H'(k)  0 '(A)  //(A). 

In  equation  (3)  P(k)  is  the  predicted  covariance  matrix,  with 

P(k  + D-  <t>(A  + 1)  P(k)  <P  (k  + 1)  + F’(A  + 1) 
S(k  + 1)  T(A  + 1). 

The  prediction  is 

X (A  + 1)  - <t»( A + 1)  X(k). 


(2) 

(3) 

(4) 

(5) 


The  filter  operates  in  a predicl-and-correcl  fashion.  This  suggests  the  following  simple  deriva- 
tion. 


Equation  (1)  is  the  least-square  estimate  between  the  prediction  and  the  measurement  at 
the  Ath  sample,  which  is  obtained  by  minimizing  the  cost  function 


7(A)  - [jf(A)  - JT(A)]'  P'Hk)\x(k)  - X(k) 
+ \xM(k)  - HX(k)\'Q  '(A)  |jfm(A)  - HX(k) 


(6) 


with  respect  to  X(k).  The  value  of  A' ( A ) which  minimizes  X(k),  denoted  by  X(k),  is  the  best 
estimate  of  X(k)  and  is  given  in  equations  (1)  through  (3).  When  the  best  estimate  of  X(k) 
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is  given,  the  best  prediction  is  simply  equation  (5)  with  the  covariance  of  (4).  The  process  is 
then  simply  repeated  recursively  with  equations  (4)  and  (5)  being  the  prediction  and  equations 
(1)  through  (3)  being  the  correction. 

The  SRIF  filler  is  a means  of  implementing  the  Kalman  filter  which  depends  heavily  on 
Cholsky  decomposition  and  the  Householder  matrix  triangulation  algorithm  [1],  The  Cholsky 
decomposition  is  performed  on  a symmetric  positive-definite  matrix  by  factoring  it  to  the  pro- 
duct of  a lower  triangular  matrix  L and  its  transpose: 

Q - L L' 
and 

Q~'  - L-'. 

The  algorithm  for  obtaining  L , found  in  Ref.  1,  is 

ln  “ J " 1.  » - 1. 

K,  “ dkA,r  k “ i + 1 "* 

and 

<t,k  ~ ~ *Af  * ” J + 1.-.  » and  / - k n. 


In  this  report  the  LDL'  factorization  described  in  Ref.  1 is  used  to  construct  the  SRIF- 
like  filter  rather  than  using  the  L L'  factorization.  The  algorithm  for  obtaining  the  LDL’  fac- 
torization is 

Q - L D L' 
and 

Q ' - D 1 L \ 

where 

In  - 1.  J - 1 n - I, 

d / - «//• 

A,  - <?*//«*,.  k - j + 1 n. 

and 

q,k  - Q,k  ~l,Aidn  k -7  + 1 n and  i - k n. 

The  matrix  L is  lower  triangular  in  form  and  has  diagonal  elements  of  1.  The  matrix  D is  a 
diagonal  matrix  with  diagonal  elements  of  dr 

The  derivation  of  a SRIF  filter  using  the  LDL'  factorization  proceeds  as  follows.  The  cost 
function  in  equation  (6)  can  be  written  as 

J - (X  - XY  R D R (k  - X)  + (X„  - HXY 

(I/)-'  V L~HXU-  HX),  (7) 
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where  the  k denoting  sample  number  has  been  dropped  for  notational  convenience,  P~'(k)  is 
factored  into  R D R\  and  Q(k)  is  factored  into  L DM  L'  (so  that  Q'(k)  - (i')-1  D m L~]). 
Equation  (7)  can  be  rewritten  as 


J - (Z  - R'X)'  D(Z  - R'Z)  + ( ZM  - HWXY  Dmx(Zm  - HWX), 


(8) 


where 


and 


Z - R X, 
ZM  - L~'  X, 


H w - L 'H. 

Equation  (8)  can  be  rewritten  more  compactly  as 


J - 


R' 
H w 


X - 


Z 

ZM 


D 0 


0 D~l 


R' 

Hw 


X - 


Z 

Zm 


(9) 


With  the  notation 


D 


D 0 
0 Dm' 


the  cost  function  is  unaltered  if  an  orthogonal  transform  T,  where  TD  T — D,  is  multiplied  by 
the  new  resulting  vector  in  equation  (9).  Consquently  use  of 


R’ 

Hw 


X - 


Z 

Zm 


in  J - C'  D C yields  the  same  cost  as 


J - C'T'D  T C. 

In  addition,  if  T,  which  is  a n-plus-m  square  matrix,  is  chosen  such  that 

T 

and 

T 

the  cost  then  becomes 


m ' 

R' 

R' 

Hw 

0 

Z 

Zm 


J - (R'X  - Z)'  D(R'X  - Z)  + e'Du'  e. 
By  inspection  the  least-square  estimate  of  X is 

R'X  - Z,  or  X - (*  )-'  Z, 


(10) 

(ID 

(12) 

(13) 


r 
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the  minimum  value  of  the  cost  J is  e'DM]e,  and  the  smoothed  covariance  is  P(k)  = R'D  R. 
For  simplification  equation  (10)  is  augmented  to  equation  (11)  yielding 


^ ' 

R' 

z 

R 

z 

Hw 

zM 

0 

e 

(14) 


Later  it  will  be  shown  that  the  smoothed  covariance  factor  R does  not  have  all  l’s  on  the  diago^ 
nal  elements  as  the  factors  R and  L did.  This  can  be  modified  by  relating  the  factors  R and  D 
to  new  factors  R and  D , where  R has  diagonal  elements  of  1.  This  can  be  shown  by  rewriting 
the  cost  of  equation  (12)  as 

J - (X  - XY  R D R'  (X  - X)  + e'DM'  e. 


Since  the  smoothed  covariance  can  be  rewritten  as  P - R D R ' or  P = R D R,  the  cost  func- 
tion can  be  rewritten  with  either  way  unaltered.  The  relation  can  be  found  by  noting  the  2-by-2 
example 


'll  0 

d\  0 

'll  '21 

'21  '22 

0 d2 

0 '22 

which  can  be  rewritten  as 


1 0 

'll  0 

d\  0 

'11  0 

1 '21/ 'll 

'2|/'ll  1 

0 '22 

0 d2 

0 '22 

0 1 

Combining  terms  in  equation  (IS)  yields 


1 0 

d\'\\  0 

1 '2l/'ll 

'2l/'ll  J 

0 d2rl 2 

0 1 

This  is  the  desired  factored  form  R D R'.  The  algorithm  can  easily  be  generalized  to  yield 

d,  — d,  r„2,  / — 1 n,  (16) 

and 

r„  ~ rjr,,.  j - i,  ....  n.  (17) 


The  least-square  estimate  of  X in  equation  (13)  is  given  by  R'X 
the  form 

R'X  - Z. 


Z,  which  is  also  altered  into 
(18) 


By  example  again,  equation  (13)  can  be  written  as 


*\ 

'll  '12 

_ Z, 

° 7j2 

*1 

|z-2 
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which  can  be  written  as 


1 'V'n 

'll  0 

Z\ 

0 Tu 

0 1 

*2 

2 2 

which  in  turn  can  be  rewritten  as 


1 ^2|/Z|  1 

*1 

Z|/Z|| 

0 1 

*2 

Z2/'r22 

Equation  (19)  is  the  desired  form  (18),  where 

z,  - z,/r,„  / - 1 n. 


(19) 

(20) 


As  a summary  of  the  preceding  results,  the  transform  T is  applied  to  the  augmented 
matrix  equation  (14)  and  the  desired  factored  form  is  obtained  by  applying  equation  (16),  (17), 
and  (20).  We  next  consider  how  the  matrix  transform  T which  triangulates  a matrix  with  the 
property  T D'T  — D is  obtained. 


MODIFIED  HOUSEHOLDER  TRANSFORM 

The  elementary  Householder  transform  given  in  Ref.  1 is  obtained  as  follows.  Let  the  vec- 
tor U be  normal  to  the  plane  (A.  An  arbitrary  vector  Y can  be  represented  by 

Y = ( Y U)  0 + V,  (21) 


where  U — U/JV'U  and  V is  that  part  of  Y that  is  orthogonal  to  U.  The  reflection  of  Y , 
denoted  by  Y„  in  the  plane  U is 

Y,  - — ( K* 0)0+  V (22) 

and  is  represented  in  Fig  1.  Eliminating  Ffrom  equations  (21)  and  (22)  yields 

Y,  - Y - U - (/  - p UU')  Y - TY,  (23) 


T-  'I'J**-'  ■ ■ "" 


I 


* 


i 
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where  p is  defined  as  p - 2/6"  DU.  The  matrix  T is  an  elementary  Householder  transform 
with  properties  T = T'  and  T 2 = /,  and  it  can  be  shown  to  triangulate  a matrix. 

To  obtain  the  desired  results  required  in  the  previous  section,  the  dot  products  of  equa- 
tion (23)  are  modified  by  a diagonal  matrix  D to  yield 

Y,  = Y - 2 | U - (/  - pUU'D)  Y - TY.  (24) 

Four  properties  are  given  and  proved.  Property  1 is 

7”  — T (T  is  symmetric). 

Since,  from  equation  (35), 

T'-U  - 0 UU'DY 
= (/  - pD'UU') 

- (/  - pDUU'). 

the  proof  that  T — T requires  showing  that  DUU'  — (/f/'O.  If  Z>£/6"  — (/6"Z>,  then 

U'(DUU')  U = U'(UU'D)U 


or 


which  is  an  identity.  QED. 
Property  2 is 


(U'DU)  (U'U)  = (f/'6)  (U'Dt/), 


r DT  - D. 


Since 

r/)7=  (/  - p UU'DY  DU-  pUU'D ) 

-(/  - PDUU  ) DU-  PUU'D) 

- (D  - pDUU'D)  U - PUU'D) 

- D - 1PDUU  D + P2DUU'DUU'D 

- D - 2pDu\u'D  - (p/2)  U'DUU'd], 
the  proof  requires  showing  that 

U'D  - (p/2)  U' DUU' D - 0 


or  that 


U'D  - 


2 (U'DU) 


U'DUU'D  - 0 


or  that 


U'DUU'D  - U'DUU'D  - 0. 


which  is  an  identity.  QED. 
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To  express  properties  3 and  4,  the  following  notation  is  introduced: 

d\  0 0 0 

Y'DY  . n 0 <0  o 0 
where  D * 0 0 ...  0 

1 0 0 0 d„ 


in  which 

M|  - y\  + <r, 

«2  - y-i, 

“}  - y* 

and 

Property  3 is 

p U’  DU  <1X<TU\ 

which  is  proven  by  writing 

2 2 

U'DU  (y,  + it)2  rf,  + yl  d 2 + ...  + y2d„ 

2 

2y\irdt  + <r2d\  + Ofai  + y 2^  2 + + yndn) 

2 

2y,rr<f|  + <r2(/|  + tr2d\ 

1 1_ 

dttrly,  + it)  d[<ru\ 

QED. 

Property  4 is 


y,i 

— cr 

TY  - 

y,  2 

- 

0 

•Vrn 
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The  proof  is  as  follows.  From  equation  (35) 

rr-r-^u 


- Y - 


- Y - 


>,(>,  + a)d\  + y]d2  + ...  +yj/d„  U 
d |(T  U ! 

(yi^d,  + (r^d j)  U 


- r - —+■  --  u 

*1 


- Y - U 


y i 
y-i 

y\  + <* 
V2 

. 

— O’ 

0 

Yn 

y n 

6 

QED.  We  will  use 


such  that 


7T-->'  + jIlw(/ 


so  as  to  have  the  convenience  of  postive  diagonal  elements  in  the  resulting  triangularized 
matrix.  Properties  3 and  4 can  be  used  to  triangulate  a matrix  by  applying  them  on  successive 
columns  of  the  matrix. 

The  following  example  illustrates  the  use  of  the  algorithm: 


*u 

*12 

*13 

*-| 

*12 

*13 

*21 

*22 

*23 

0 

*22 

*23 

*31 

*32 

*33 

0 

*32 

*33 

*41 

*42 

*43 

0 

*42 

*43 

*u  +<7, 

U - °21 

*31 

o-,  - sgn  auyJ(a^d^  + a^d2  + a}td}  + 
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and 


in  which 


and 


b0 a ,j  + Pyuu,,  j - 1 3 and  / = 1,  ...  4. 

fi  - — - — 
d\(TU\ 

Yu  “ + a2ju2  + a2uu2  + ahu3  + a4/u4. 


The  process  is  repeated  for  each  successive  submatrix.  The  next  step  is 


where 


<T|  *12  *13 

O’]  *12  *13 

1 0 

0 *22  *23 

0 0-2  c23 

0 t2 

0 b}2  b 33 

0 0 C33 

0 *42  *43 

| 0 0 C43 

U 


*22  +<x 

bn 

bn 


<t2  - sgn  (b22)  y/(b$2d2  + b$2d}  + bl2djld2. 


and 

c,j b,j  + Py2jUj_lt  j - 2.  3 and  / - 2,  ...  4, 

in  which 


d2(TU ! 


and 

-y2;  ” b2jU\  + *3jM2  + b4juy 

In  summary,  it  was  shown  that  the  modified  Householder  algorithm  meets  the  require- 
ments of  the  modified  SRIF  filter  (7"D  T - D,  with  T triangulating  the  matrix).  A simple 
means  of  obtaining  the  prediction  portion  of  the  SRIF  filter  under  an  important  case  is  next 
considered. 


PREDICTION  PROCESS 

The  smoothing  portion  of  the  Kalman  filter  using  a modified-SRIF-filter  implementation 
updates  the  factorization  of  the  smoothed  covariance  and  the  transformed  best  estimate.  It  is 
desirable  to  update  the  prediction  process  in  a commensurable  form.  Only  an  important  special 
case  is  considered:  The  process  noise  IF(fc)  is  assumed  to  be  zero,  and  the  state  transition 
matrix  is  assumed  to  be  in  the  upper  triangular  form.  Equation  (4)  updating  the  predicted 
covariance  then  becomes 

P -<b  P V,  (25) 

where  the  noise  W(k)  is  removed  and  the  sample  k has  been  dropped  for  notational  conveni- 
ence. The  inverse  of  (25)  is  taken,  yielding 


\ 


l 


I 
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f P~l  - P~'  4>-'.  (26) 

l>  The  covariances  are  replaced  with  their  factorizations 

R D R’  - R D R' 

which  can  be  rewritten  as 

R D R - (<t')_l  R D(<t>-'RY. 

Since  (4>') _l  R is  of  the  lower  triangular  form,  then 

R - (4>')_l  R (27a) 

and 

D-D.  (27b) 

which  shows  the  simple  form  of  updating  the  factors  of  the  prediction  covariance. 

The  predicted  state  given  by 

X -<t>  X 

from  equation  (7)  is  transformed  by 

(R')-‘  Z - <t>(R')-’  Z. 

where  X — (7?')-1  Zand  X — (R')~l  Z.  Solving  for  Zyields 

Z - (R  ) 4>(R')-' Z.  (28) 

Substituting  R from  (27a)  into  (28)  yields 

Z - Z. 

The  transformed  smoothed  and  predicted  states  are  seen  to  be  identical. 

Sometimes  it  is  desirable  to  implement  a fading-memory  filter  by  making  the  smoothed 
covariance  larger.  This  is  accomplished  by  rewriting  equation  (26)  as 

P ' - (<!>') -*  aP  ' d>-'. 

The  parameter  is  a scalar  representing  a time  fading  by 

a — e~^J, 

where  r is  the  time  constant  and  I is  time.  Equation  (27b)  is  modified,  and  equations  (27a) 
and  (27b)  now  are 

R - (*’)-'  R 
and 

D — yfa  D, 

and  these  equations  now  remain  the  same  under  the  fading-memory  condition. 

A functional  flow  of  the  filter  using  the  special-case  prediction  is  shown  in  Fig.  2.  The 
measurement  and  measurement  matrix  are  first  transformed  using  the  factorization  of  the 
measurement  covariance  matrix.  For  zero  process  noise  and  upper-triangular-form  state  transi- 
tion matrices,  the  prediction  process  is  given  simply.  The  transformed  predicted  and  smoothed 
states  are  the  same,  and  the  covariances  are  related  by  a simple  transform.  The  smoothing  on 
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the  filter  is  given  by  the  modified  Householder  transform  previously  described,  and  the  terms 
are  rearranged  to  obtain  a more  desirable  form.  This  completes  the  filter  operation  except  for 
forming  the  smoothed  estimate  and  the  covariance  if  desired  from  the  transformed  state  and 
the  covariance  factors. 

SUMMARY 

The  SRIF  filter  is  a numerical  means  of  solving  the  Kalman-filter  equations.  In  this 
report  the  SRIF  filter  was  modified  to  accommodate LDL  factorizations  of  the  covariances  rather 
that  the  usual  LL  factorizations.  This  factorization  required  that  the  Householder  algorithm  b>- 
modified  and  required  a small  manipulation  of  the  results  after  transforming  Under  a special 
case  the  prediction  process  was  shown  to  be  quite  simple  using  the  LDL'  factorization 

The  reason  for  investigating  the  SRIF  fiter  using  LDL'  factorizations  was  to  try  to  elim- 
inate the  square-root  operations  found  in  the  SRIF  filter.  Although  the  square-root  operations 
using  LDL'  factorization  now  do  not  appear  in  the  prewhitening  process,  they  still  are  necessary 
in  the  modified  Householder  algorithm.  Consequently  the  results  are  not  as  strong  as  was  first 
hoped  for.  An  interesting  result  does  occur  in  the  use  of  a fading  memory:  Only  the  diagonal 
elements  D of  the  LDL'  factorization  need  be  weighted  rather  than  the  entire  covariance 
matrix. 
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